Better Data From Better Measurements Using Computerized Adaptive Testing
نویسنده
چکیده
The process of constructing a fixed-length conventional test frequently focuses on maximizing internal consistency reliability by selecting test items that are of average difficulty and high discrimination (a “peaked” test). The effect of constructing such a test, when viewed from the perspective of item response theory, is test scores that are precise for examinees whose trait levels are near the point at which the test is peaked; as examinee trait levels deviate from the mean, the precision of their scores decreases substantially. Results of a small simulation study demonstrate that when peaked tests are “off target” for an examinee, their scores are biased and have spuriously high standard deviations, reflecting substantial amounts of error. These errors can reduce the correlations of these kinds of scores with other variables and adversely affect the results of standard statistical tests. By contrast, scores from adaptive tests are essentially unbiased and have standard deviations that are much closer to true values. Basic concepts of adaptive testing are introduced and fully adaptive computerized tests (CATs) based on IRT are described. Several examples of response records from CATs are discussed to illustrate how CATs function. Some operational issues, including item exposure, content balancing, and enemy items are also briefly discussed. It is concluded that because CAT constructs a unique test for examinee, scores from CATs will be more precise and should provide better data for social science research and applications.
منابع مشابه
A New Method for Detection of Backscattered Signals from Breast Cancer Tumors: Hypothesis Testing Using an Adaptive Entropy-Based Decision Function
Introduction In recent years methods based on radio frequency waves have been used for detecting breast cancer. Using theses waves leads to better results in early detection of breast cancer comparing with conventional mammography which has been used during several years. Materials and Methods In this paper, a new method is introduced for detection of backscattered signals which are received by...
متن کاملEvaluation of performance quality of SPECT camera in Shariati Hospital of Tehran University of Medical Sciences [Persian]
In nuclear medicine, there are two methods of imaging, planar and tomography. Single photon emission computerized tomography (SPECT) shows better image details and therefore is influenced more by image parameters such as resolution, uniformity, sensitivity, etc. Manufacturers provide customers with data which are obtained by complicated and sometimes secret methods. Marketing companies te...
متن کاملLong-term Streamflow Forecasting by Adaptive Neuro-Fuzzy Inference System Using K-fold Cross-validation: (Case Study: Taleghan Basin, Iran)
Streamflow forecasting has an important role in water resource management (e.g. flood control, drought management, reservoir design, etc.). In this paper, the application of Adaptive Neuro Fuzzy Inference System (ANFIS) is used for long-term streamflow forecasting (monthly, seasonal) and moreover, cross-validation method (K-fold) is investigated to evaluate test-training data in the model.Then,...
متن کاملAdaptive yield response of winter wheat cultivars across environments in Poland using combined AMMI and
The objective of the paper was to illustrate using and usefulness of a joint AMMI and cluster analyses to assess the grain yield adaptive response of Polish and foreign 31 winter wheat cultivars in a range of 20 environments (locations) and across 3 years (2005-2007) under integrated crop management, using data obtained in the post-registration variety testing trials (called PDO trials), to ide...
متن کاملDevelopment and Evaluation of a Confidence-Weighting Computerized Adaptive Testing
The purpose of this study was to examine whether the efficiency, precision, and validity of computerized adaptive testing (CAT) could be improved by assessing confidence differences in knowledge that examinees possessed. We proposed a novel polytomous CAT model called the confidence-weighting computerized adaptive testing (CWCAT), which combined a confidence-weighting scoring scheme with the gr...
متن کامل